Let be a β-smooth and α-strongly convex function. If we run GD for steps (with step size ) we have:
is called the condition number of
compare: Gradient descent convergence for β-smooth functions, Gradient descent convergence for α-strongly convex functions